Robust differential abundance test in compositional data

نویسندگان

چکیده

Summary Differential abundance tests for compositional data are essential and fundamental in various biomedical applications, such as single-cell, bulk RNA-seq microbiome analysis. However, because of the constraint prevalence zero counts data, differential analysis on remains a complicated unsolved statistical problem. This article proposes new test, robust to address these challenges. Compared with existing methods, test is simple computationally efficient, prevalent datasets, can take data’s nature into account, has theoretical guarantee controlling false discoveries general setting. Furthermore, presence observed covariates, work covariate-balancing techniques remove potential confounding effects draw reliable conclusions. The proposed applied several numerical examples, its merits demonstrated using both simulated real datasets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust statistical methods for differential abundance analysis of metagenomics data

This document outlines my 2011-2012 AMSC project for the 663/664 course series and in particular the mid-year progress. It is an ever evolving document. The project is to develop Metastats 2.0, a software package analyzing metagenomic data. We propose two major extensions and modifications to the Metastats software and the underlying statistical methods. The first extension of Metastats is a mi...

متن کامل

Robust Methods for Compositional Data

Abstract. Many practical data sets in environmental sciences, official statistics and various other disciplines are in fact compositional data because only the ratios between the variables are informative. Compositional data are represented in the Aitchison geometry on the simplex, and for applying statistical methods designed for the Euclidean geometry they need to be transformed first. The is...

متن کامل

Robust factor analysis for compositional data

Factor analysis as a dimension reduction technique is widely used with compositional data. Using the method for raw data or for improperly transformed data will, however, lead to biased results and consequently to misleading interpretations. Although some procedures, suitable for factor analysis with compositional data, were already developed, they require pre-knowledge of variable groups, or a...

متن کامل

Robust Compositional Polarity Classification

We describe a pattern-based system for polarity classification from texts. Our system is currently restricted to the positive, negative or neutral polarity of phrases and sentences. It analyses the input texts with the aid of a polarity lexicon that specifies the prior polarity of words. A chunker is used to determine phrases that are the basis for a compositional treatment of phrase-level pola...

متن کامل

Discriminant analysis for compositional data and robust parameter estimation

Abstract Compositional data, i.e. data including only relative information, need to be transformed prior to applying the standard discriminant analysis methods that are designed for the Euclidean space. Here it is investigated for linear, quadratic, and Fisher discriminant analysis, which of the transformations lead to invariance of the resulting discriminant rules. Moreover, it is shown that f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Biometrika

سال: 2022

ISSN: ['0006-3444', '1464-3510']

DOI: https://doi.org/10.1093/biomet/asac029